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Abstract 



We present a new threshold phenomenon in data structure lower bounds where slightly reduced 
update times lead to exploding query times. Consider incremental connectivity, letting t u be the time to 
insert an edge and t q be the query time. For t u = ft(t q ), the problem is equivalent to the well-understood 
union-find problem: InsertEdge(s, t) can be implemented by Union(Find(s), FlND(i)). This gives 
worst-case time t u = t q = 0(\gn/ lg lg n) and amortized t u =t q = 0(a(n)). 

By contrast, we show that if t u = o(lgn/lglgn), the query time explodes to t q > n 1-0 ^ 1 '. In other 
words, if the data structure doesn't have time to find the roots of each disjoint set (tree) during edge 
insertion, there is no effective way to organize the information! 

For amortized complexity, we demonstrate a new inverse-Ackermann type trade-off in the regime 

tu = 0(t q ). 

A similar lower bound is given for fully dynamic connectivity, where an update time of o(lg n) forces 
the query time to be n 1 ^"^. This lower bound allows for amortization and Las Vegas randomization, 
and comes close to the known 0(lg n ■ (lg lg n)°^ ) upper bound. 



1 Introduction 



We present a new threshold phenomenon in data structure lower bounds where slightly reduced update times 
lead to exploding query times. Previous trade-offs where smooth and much weaker. The new explosive lower 
bounds are found hidden in some very well-studied problems: incremental and fully-dynamic connectivity. 

1.1 Our Results 

The union-find problem is to support the following operations on a collection of disjoint sets, starting from 
n singleton sets {1}, . . . , {n}: 

Find(u) : Return an element in the same set as v that uniquely identifies the set. (This is called the root, or 

the representative of the set.) 
Union (u, v) : Join the sets identified by u and v, assuming these are roots of their own sets. 

The terminology for this problem stems from the usual implementation as a forest, in which each tree 
represents a set. Find involves walking to the root of u's tree, potentially doing useful work (path compres- 
sion). Union simply involves adding an edge between the roots (whose direction is usually determined by 
the size of each subtree, cf. union by rank). 

The union-find problem has been studied into excruciating detail and is now essentially understood. 
From an amortized perspective, Tarjan HTar751 showed that a sequence of n — 1 unions and m finds can 
be supported in time 0{n + ma(m, n)). See [TvL84, Pou90] for different analyses and trade-offs between 
amortized running times. From a worst-case perspective, the classic union-by-rank gives union in constant 
time and find in O(logn) time. Trade-offs were addressed by Blum [Blu86|, with an improvement by 
Smid [Smi90|. They show that, if the time for union is bounded by ^union, Find can be supported in 
worst-case 0(lgn/lgtuNioN)- Finally, Alstrup et al. [ABAR99] showed that the amortized and worst-case 
trade-offs can be achieved simultaneously. These bounds are known to be optimal in the powerful cell-probe 
model (see below for a review of the lower bounds). 

Here we consider an obvious strengthening of the problem, where we allow: 

Link(u, v) : Join the sets containing u and v if these sets are different. 

The link-find problem is a natural way to solve one of the most basic graph problems: incremental 
connectivity. This is the problem of maintaining an undirected graph under edge insertions and connectivity 
queries. New edges may link arbitrary nodes, possibly introducing cycles. Two nodes are connected if they 
find the same identifier. 

We now consider the worst-case trade-offs for link-find and incremental connectivity. Since link-find 
solves incremental connectivity, we want upper-bounds are for link-find and lower-bounds for incremental 
connectivity. 

Let tLiNK be the link time and £find be the find time. In the regime £link > ^find, the problem can be 
solved by union-find, since we have time to find the roots of u and v and union them if they are different. 
Using the upper bounds for union-find, we obtain £find = 0(\gn/ \gt u ), and in particular the balanced 
point t LlNK = t F iND = 0(lgn/lglgn). 

If we insist on ^link = o(^find). union-find no longer suffices. In fact, we show a surprisingly abrupt 
trade-off that essentially signifies the "end of data structuring" even for incremental connectivity: 

Theorem 1. Any data structure for incremental connectivity over n vertices that supports edge insertions in 
worst-case time ^link = °( ig^T ra ) m ust have worst-case query time ^find > n 1 ""^ in the cell-probe model 
with cells ofO(lg n) bits. 
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By reduction, we get the same trade-off for link-find. This can be contrasted with the very smooth trade- 
off for union-find, £find = 0(lgn/lg ^union), £-g-» the standard union-by-rank with 0(l)-time UNION and 
(9(lgn)-time Find. Our result shows a remarkable dependence of edge insertion on the Find operation. As 
soon as a new link doesn't have time to locate the roots, the query degenerates into almost linear time. 

We will also analyze the amortized bounds for link-find, which are again weaker than those for union- 
find when t q > t u , but the bounds are less striking. 

We show a similar computational phenomenon for fully dynamic connectivity where both edge inser- 
tions and deletions. In this fully-dynamic case, we hit the wall even when we amortize. 

Theorem 2. Any data structure for fully dynamic connectivity in a graph of n vertices with update time t u — 
o(lg n) must have query time t q > n 1 ^ ^. This bound allows amortization and Las Vegas randomization 
(expected running times), and holds in the cell-probe model with cells ofO(\g n) bits. 

Thorup [ThoOO | has an almost matching upper bound of t u = 0(lg n ■ (lg lg n) 3 ) and t q = o(lg n). This 
data structure uses both Las Vegas randomization and amortization. 

Supporting both LINK and UNION. To fully appreciate the difficulty of finding roots, consider a data 
structure that supports both a traditional UNION update between roots and LINK between arbitrary nodes. 
We know from previous works that if UNION takes ^union time, then the best worst-case query time is 
0(logn/logtuNioN)- This holds both for find-root and connecitivity queries. We can always implement 
links with find-root and union in 0(logra/logiuNioN + ^union) time, and this preserves the query time. 
However, in the case where the query time dominates the union time, that is, iumoN = o(log nj logtuNioN)> 
we would like to avoid finding the roots, and get a query time closer to £tjnion- 

A similar phenomenon appeared in connection with union-find with deletions. Kaplan et al. [KST02b] 
considered this problem but wanted bounds where n represented the size of the actual tree(s) involved in an 
operation. All worst-case bounds are trivially local, and [KST02b] proved refined the standard amortized 
analysis to work locally, though the bound becomes a bit weird with the standard notation: a(n) is OK, 
but otherwise, it becomes a(n ■ \M/N] , n) amortized time per find where M and N are the global number 
of finds and unions, respectively. With the notation from ||ARG + Q5l . the local amortized find bound is 
0( a \M/N~\ ( n ))- They showed how to augment union-find with a delete operation if we when deleting an 
element x, first find the root and then perform a local rebuilding step in the tree that x is deleted from. For 
t u = 0(1), this gave them both find-root and delete in 0(log n) time. Similar to our case, they asked if the 
deletion time could be made better than this find time. For the deletions, the answer was yes. Alstrup et al. 
IAR G + 05ll proved that deletions could be supported locally in constant time without affecting the O(logn) 
bound on the query time. 

Back to our union-find with links problem, as in the deletions case, we would like to support LINK 
better than FIND without affecting the FIND bound. Here we show that any such positive result is totally 
impossible. If we try to beat the query time, supporting links in £link = o(log n/ log ^union) time, then the 
query time explodes from £find = 0{\ogn/ \ogt u ) to £find > n l ~°^ time. 

1.2 Lower Bounds 

Many of the early lower bounds for union-find were in (restricted versions of) the pointer machine model 
llTar79l lBan80l IPou96l lBlu86ll . 

In STOC'89, Fredman and Saks [FS89] were the first to show dynamic lower bounds in the cell-probe 
model. They studied the partial sums problem and the union-find problem. The partial sums problem asks 
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to maintain an array A[l . . n] under pointwise updates and queries for a prefix sum: Yli<k ^A- ^ or partial 
sums and for worst-case union-find, Fredman and Saks showed a lower bound of t q = 0(lg n/ lg(t u lg n)). 
For amortized union-find, they gave an optimal inverse-Ackermann lower bound. A different proof of the 
same bounds was given by Ben-Amram and Galil in FOCS'91 [B AGOli 

In STOC'99, Alstrup, Ben-Amram and Rauhe [ABAR99] improved the trade-off for union-find to 
t q = f2(lg n/ lg t u ), which was also the highest known trade-off for any problem. In STOC'02, Kaplan, 
Shafrir and Tarjan [KST02a| showed that the optimal worst-case and amortized trade-offs for union-find 
also hold for a weaker Boolean version where the user specifies set identifiers and where we only have 
membership queries. From a lower bound perspective, the tricky part is that the query output is a single bit. 
Identifiers can always be viewed as special elements of sets. Thus they get the same lower bound trade-off 
for incremental connectivity: edges are only added between current set identifiers, and connectivity queries 
are between arbitrary nodes and current set identifiers. This lower-bound trade-off for incremental connec- 
tivity is tight when t u = Q(t q ), matching the previously mentioned upper-bounds for link-find. However, 
by our Theorem [T] the incremental connectivity queries hit a wall when the update time becomes lower. 

The work of Patra§cu and Demaine from STOC'04 [PD06] gives the best trade-offs known today, 
for any explicit problem. They considered partial sums and fully dynamic connectivity, and showed 
that, if max{t u ,t q } = O(Blgn), then mm{t u ,t q } = U(log B n). In particular, their bounds implied 
max{i u , t q } = 0(lg n), whereas previous results implied max{t u , t q } = Q(lg n/ lg lg n). 

These bounds are easily seen to be optimal for the partial sums problem. The standard solution is to 
create an ordered binary tree with leaf set [n] ; each internal node maintains the sum of its children. Updates 
and queries are trivially supported in 0(logn) time. To get a trade-offs, we can instead use a .B-tree with 
degree B. The time of an update is the height of the tree, which is 0(log B n). However, to answer a query, 
we need to add up all left siblings from the path to the root, so the query time is 0{B \og B n). 

Our results significantly improve the known trade-offs in the regime of fast query times. Note that the 
previous strongest bounds from [PD06| could at most imply t q = $7(n e ) even for constant update time. 
Here e depends on the constant in the update time. For example, allowing only 4 cell probes for the updates, 
[PD06, careful inspection] gets a query lower bound of Sl(nTe ). Our Theorem [2] says for another problem 
that we with o(log n) probes get a query lower bound > ri 1-0 ^ 1 ) queries. 

The trade-offs of [PD06] are optimal in the full range for the partial sums problem. For incremental and 
fully dynamic connectivity, the previous mild trade-offs [KST02a, PD06] are optimal in the regime t u ^> t q ; 
it is only the regime of fast updates that causes the abrupt transitions in Theorems [T] and [2] 

Lower bounds beyond the balanced tree. The previous lower-bounds we discussed are essentially all 
showing that the we cannot do much better than maintaining information in a balanced tree. All operations 
follow well-understood paths to the roots. Trade-offs were obtained by increasing the degree, decreasing the 
height: the faster of updates and queries would just follow the path to the root while the slower would have 
to consider siblings on the way. The lower bounds from [PD06 1 are best possible in this regard. 

Our stronger trade-offs for incremental and fully-dynamic connectivity shows that there is no such sim- 
ple way of organizing information; that the links between arbitrary vertices changes the structure too much 
if the update times is not long enough, we cannot maintain the balanced information tree. 

2 Simulation by Communication Games Results 

Generally, for the data structure problems considered, we are going to find an input distribution that will 
make any deterministic algorithm perform badly on the average. This also implies expected lower bounds 
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for randomized algorithms. 

Consider an abstract dynamic problem with operations Update(uj) and QuERY((/j). Assume the se- 
quence of operations is of fixed length, and that the type of each operation (query versus update) is fixed 
a priori. The "input" u-i or qi of the operation is not fixed yet. Let I a and Ib be two adjacent intervals of 
operations, and assume that every input u« or qi outside of I a U Ib has been fixed. What remains free are the 
inputs Xa during interval I a and Xb during interval Ib- These inputs (Xa, Xb) follow a given distribution 
V. 

It is natural to convert this setting into a communication game between two players: Alice receives Xa, 
Bob receives Xb, and their goal is to answer the queries in Xb (which depend on the updates in Xa)- In 
our applications below, the queries will be Boolean, and it will even be hard for the players to compute the 
and of all queries in the Ib interval. Each player is deterministic, and the two players can exchange bits of 
information. The last bit communicated should be the final answer of the game, which here is the and of 
the queries in Ib- The complexity of the game is defined as the total communication (in bits) between the 
players, in expectation over V. 

We will work in the cell-probe model with w-hit cells; in the applications below, w = 0(lgn). For 
notational convenience, we assume the data structure must read a cell immediately before writing it (but it 
may choose to read a cell without rewriting it). Let Wa be the set of cells written during time interval I a, 
and Rb be the set of cells read during interval Ib- 

Lemma 3. For any p > 0, the communication game can be solved by a zero-error protocol with complexity 
Ex>[|Wa| • o(ig ~) + 0(w) ■ (\Wa n Rb\ +p\Rb\)]- 

Proof. Alice first simulates the data structure on the interval I a- The memory state at the beginning of I a 
is fixed. After this simulation Alice constructs a Bloom filter BBlo701 with error (false positive) probability 
p for the cells Wa- The hash functions needed by the Bloom filter can be chosen by public coins, which 
can later be fixed since we are working under a distribution. Alice's first message is the Bloom filter, which 
requires \Wa\ • 0(\g -) bits. 

Bob will now attempt to simulate the data structure on Ib- The algorithm may try to read a cell of the 
following types: 

• a cell previously written during Ib'- Bob already knows its contents. 

• a cell that is positive in the Bloom filter: Bob sends the address of the cell to Alice, who replies with 
its contents; this exchange takes 0(w) bits. 

• a cell that is negative in the Bloom filter: Bob knows for sure that the cell was not written during I a- 
Thus, he knows its contents, since it comes from the old fixed memory snapshot before the beginning 
of I A - 

With this simulation, Bob knows all the his answers and can transmit the final bit telling if they are all true. 
The number of messages from Bob is \ Wa ni?s| (true positives) plus an expected number of false positives 
of at most p\Rb\- □ 

We will use the simulation to obtain lower bounds for \Wa H Rb\, comparing the complexity of the 
protocol with a communication lower bound. This simulation works well when \Wa H Rb\ ~ \Wa U 
Rb\ / i^ig n , since we can use p « y^, and make the term \Wa H Rb \ dominate. Unfortunately, it does not 
work in the regime \Wa H Rb\ U i?s|/lgn, since one of the terms proportional to \Wa\ or \Rb\ 

will dominate, for any p. 

To give a tighter simulation, we use a stronger communication model: nondeterministic complexity. In 
this model, a prover sends a public proof Z to both Alice and Bob. Alice and Bob independently decide 
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whether to accept the message, and they can only accept if the output of the communication game is "true" 
(i.e. all queries in Ib return true). In this model Alice and Bob do not communicate with each other. Alice's 
answer is a deterministic function /a{Xa, Z) of her own input and the public proof. Similarly, we have 
Bob's answer /b{Xb, Z). For the protocol to be correct, Ja(Xa, Z) and /b(Xb, Z) may only both be true 
if this is the answer to the game. 

Our goal for the prover is to define a short public proof Z(Xa, Xb) that will lead Alice and Bob to the 
desired answer Ja{Xa, Z(Xa, Xb)) A /b(Xb, Z(Xa, Xb))- The complexity of the protocol is the of the 
game should be the and of all queries in Ib- Since we are working under a distribution, the bit length of 
the prover's message Z(Xa, Xb) is a random variable, and we define the complexity of the protocol as its 
expectation. 

Lemma 4. The communication game can be solved by a nondeterministic protocol with complexity 
E v [0(w)-\W A nR B \ + 0(\WAUR B \)]. 

Proof. We will use a retrieval dictionary (a.k.a. a Bloomier filter, or a dictionary without membership). 
Such a dictionary must store a set S from universe U with k bits of associated data per element of S. When 
queried for some 16S, the dictionary must retrieve x's associated data. When queried about x ^ S, it may 
return anything. One can construct retrieval dictionaries with space 0(k\S\ + lglg |?7|); see e.g. |DP08]. 

The message Z(Xa, Xb) of the prover will consist of the addresses and contents of the cells X = 
\Wa n Rb\, taking 0(w) bits each. In addition, he will provide a retrieval dictionary for the symmetric 
difference Wa^-Rb = (Wa \ Rb) U (Rb \ Wa)- In this dictionary, every element has one associated bit of 
data: zero if the cell is from Wa \Rb and one if from Rb\ Wa ■ The dictionary takes O (lg w + \ Wa U Rb | ) 
bits. 

Alice first simulates the data structure on I a- Then she verifies that all cells X were actually written 
(X C Wa), and their content is correct. Furthermore, she verifies that for all cells from Wa \X, the retrieval 
dictionary returns zero. If some of this fails, she rejects with a false. 

Bob simulates the data structure on Ib- The algorithm may read cells of the following types: 

• cells previously written during Ib- Bob knows their contents. 

• cells from X: Bob uses the contents from public proof (Alice verified these contents). 

• cells for which the retrieval dictionary returns one: Bob uses the contents from the fixed memory 
snapshot before the beginning of I a (Alice verified she didn't write such cells). 

• cells for which the retrieval dictionary return zero: Bob rejects. The prover is trying to cheat, since in 
a correct simulation all cells of Rb \ X has a one bit in the dictionary. 

If neither player rejects, we know that Rb \ X is disjoint from Wa \ X, so the simulation of Bob is correct. 
Finally Bob rejects if any of his answers are false. □ 

3 Lower Bound for Incremental Connectivity 

Theorem 5. Any data structure for incremental connectivity over n vertices that supports edge insertions 
between roots in worst-case time ^union = ( \^ig n ) and arbitrary edge insertions in worst-case time £link = 
°( lgfgn ) must have query time ^find > n 1 ^"^. 

Let e = o(l) be such that £union = o(e 2 lgn/ lg lgn). Define B = lg 2 n, C = n £ , and M = n 1_e . 
The starting point of our hard instance is essentially taken from Fredman and Saks' seminal pa- 
per HFS89II . The hard instance will randomly construct a forest of M trees. Each tree will be a perfect 
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tree of degree B and height log B (n/M). On layer of the forest we have the M roots. On layer i, we have 
exactly M ■ B l vertices with B % vertices from each tree. 

We can describe the edges between level i and i — 1 as a function : \M ■ B l ] — > [M ■ B 1-1 ] that 
is balanced: for each x € [MB ], \(fi) (x)\ = B. We will use the following convenient notation for 
composition: />j = /;o f i+1 o • • • . For example, the ancestor on level i — 1 of leaf x is f>i(x). 

Our hard instance will insert the edges describing //s in bottom-up fashion (i.e. by decreasing i, from 
the largest level up to the roots). We call "epoch i" the period of time when the edges f\ are inserted. Let W, 
(respectively Ri) be the cells written (respectively, read) in epoch i. Observe that | W% < M-I?^u NI0N . 
We will use the following convenient notation for set union: W<i = Uj<j Wj. The cells Wi \ W<i are those 
last written in epoch i. 

All the above edges where added in union-find style from roots of current trees, and indeed the above 
constitutes the hard case for union-find from [FS89 |. At this point BFS891 shows that finding a root from a 
random leaf would entail reading cells from most epochs in SI (log n/ log B) expected time. 

Our goal is to show that linking arbitrary vertices may lead to much more expensive queries. We will 
describe some very powerful metaqueries that combines links to roots and leaves with a few connectivity to 
reveal far more information than if we only had the regular connectivity queries. The metaqueries will be 
provably hard to answer, so if the links are done too quickly, the queries must be very slow. 

Our graph contains C additional special vertices, conceptually colored with the colors 1 . . C. Each 
colored vertex is connected to M/C nodes on level (the final roots of our trees). This is done in a fixed 
pattern: colored vertex 1 is connected to roots 1, . . . , M/C; colored vertex 2 to the next M/C roots; etc. 
These edges can be inserted at the very beginning of the execution, prior to any interesting updates. 

At the end of epoch 1 all trees are complete. In this state, we say the root color of a vertex is the color 
that its root is connected to. Conceptually, the hard distribution colors a random set Q of exactly M leaves 
and verifies that these are the root colors. 

To implement this test by incremental connectivity operations (Link), we first link each query leaf to 
the proposed colored vertex. Then, for i = 2 . . C, we query whether colored vertex i is connected to colored 
vertex i — 1, and then insert an edge between these two color nodes. The metaquery returns "true" iff all 
connectivity queries are negative. 

We claim that if the metaquery answers true, the coloring of Q must be consistent with the coloring 
of the roots. Indeed, if some leaf is colored i and its root is colored j ^ i, this inconsistency is caught at 
step max{i, j}. At this step, everything with color < max{i, j} — 1 has been connected into a tree, so the 
connectivity query will return true. 

Let x(Q) be the coloring of leaves in Q that matches their root colors. In the hard distribution, the 
metaquery always receives proposed colors from x(Q)-> so it should answer true. Nevertheless, the data 
structure will need to do a lot of work to verify this. Let E9 be the cells read during the metaqueiy. We 
have \R9\ < C ■ t q + 2M ■ £ijnion- The main claim of our proof is: 

Lemma 6. For any i G {1, . . . , \og B (n/M)}, we have B[\R Q n (W; \ W<i)|] = O(eM). 

Before we prove the lemma, we show that it implies our lower bound. The sets Wi \ W<i are dis- 
joint by construction, so ^2i^[\ RQ n ( w i \ w <i)W < E[|i? <9 )]. Remember that we have log s (n/M) = 
0(log(n e )/lglgn) = 0(elgn/lglgn) epochs. Thus E[|i?^[] = Q(M ■ e 2 lg n/ lg lgn). But we always 
have \R Q \ < C ■ t q + 2M ■ t u = Ct q + o(Mj^^), by choice of e. It follows that Ct q is the dominant term 
in E[|^|], so t q = n(Me 2 (lgn/lg lg n) /C) > n l ~ 2e . 
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Proof of Lemma |6l Fix i. We will prove the stronger statement that the lower bound holds no matter how 
we fix the edges outside epoch i (all /j's for j ^ i). 

To dominate the work of later epochs i — 1, . . . , 1, we consider B l i.i.d. metaqueries. Choose sets 
Q 1 , Q 2 , . . . , Q B% independently, each containing M uniformly chosen leaves. Starting from the memory 
state where all trees are completely built and the roots have been colored, we simulate each metaquery 
(Q- 7 , x(Q"0) i n isolation. We do not need to write any cells in this simulation, for the cell-probe model has 
unbounded state to remember intermediate results and in our hard distribution there is no operation after the 
metaquery. Thus the simulations of the different metaqueries do not influence each other. Let R* be the cells 
read by all B i metaqueries. By linearity of expectation, E[\R* n(Wi\W<i)|] < B { -E[\RQ D (Wi \W<i)|]. 

Let Q* = |J ■ QK Since we have fixed all />j, asking about the root color of a leaf q G Q* is equivalent 
to asking about the root color of node f>i(q) on level i. 

Claim 7. We have E[|/>i(Q*)|] > (1 - \)MB\ 

Proof. Each leaf x in some Qi is chosen uniformly, so its ancestor f<i(x) is also uniform. The M ■ B % trials 
are independent (for different Qi , Q k ), or positively correlated (inside the same Q- 7 , since the leaves must 
be distinct). Thus, we expect to collect (1 — \ je)MB % distinct ancestors. □ 

By the Markov bound \f>i(Q*)\ > \MB % with probability at least 1 - 2/e. Thus we may fix the 
sequence (Q 1 , Q 2 ,..., Q B1 ) to a value that achieves |/>i(Q*)| > \MB { while increasing E[|J2* n (W, \ 
W<i)\] by at most (1 - 2/e)" 1 = O(l). 

The only remaining randomness in our instance are the edges f\ from epoch i and the proposed colorings 
xiQ-i) given to each metaquery QK To be valid, these colorings are functions of /j, for as soon as we know 
fi, we know the whole forest including the root colors of all the leaves in the different QK The metaquery 
colors have to agree on common leaves, so they provide us a coloring x(Q*)- With fi yet unknown, we 
claim that x(Q*) has a lot of entropy: 

Claim 8. H(x(Q*)) = O(MSMgC). 

Proof. Let X be the unknown coloring of all vertices on level i. We claim it has entropy H(X) = MB 1 ■ 
log 2 C — 0{C lg n). We have not fixed anything impacting this coloring so X is a random balanced vector 
from [C] MB \ Indeed, any balanced coloring is equiprobable, because the coloring of the roots is balanced, 
all trees have the same sizes, and fi is a random balanced function. We claim that it has entropy H(X) = 
MB 1 ■ log 2 C — 0{C\gn). The number of balanced colorings is given by the multinomial coefficient 

(JUT DI v 
MB i /c MB i IC )• "'"his is the central multinomial coefficient, so it is the largest. It must therefore be at 

least a fraction (MB l )~ c > n~ c of the sum of all multinomial coefficients. This sum is C MB1 (the total 
number of possible colorings), so H(X) > log 2 (C MS Yn < - ; ) = MB % log 2 C — Clog 2 n. 

We argue that H(x(Q*)) = Vt(MB l lg C). Indeed, x(Q*) reveals the coloring of vertices /<j(Q*) on 
level i, which number at least ^MB l . Given x(Q*)» to encoding X, we just write all other colors explicitly 
using ^MB i log 2 C bits. Therefore H(x(Q*)) > H(X) - \MB i \og 2 C > MB l log 2 C - C\g 2 n - 
\MB i \og 2 C = Vt{MB i \gC). ' □ 

We consider the communication game in which Alice represents the time of epoch i (her private input 
is Xa = fi), and Bob represents the time of epochs i — 1, . . . , 1 and the metaqueries (his private input is 
Xb = x(Q*))- Their goal is to determine whether all the metaqueries return true. 

Claim 9. Any zero-error protocol must have average case bit complexity Q(MB l lg C). 
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Proof. We turn our attention to the communication game. The set of inputs of Alice and Bob that lead to a 
fixed transcript of the communication protocol forms a combinatorial rectangle. More precisely, a transcript 
t represents a sequence of transmissions between Alice and Bob. On Alice's side, there will be a certain set 
X\ of inputs making her follow t provided that Bob follows t, and we have a corresponding input set X B 
from Bob. Inputs Xa, and Xb will lead to t if and only if (Xa, Xb) £ X A X X B . Since the players must 
verify Xb = x(Q*) an d the protocol has zero error, the rectangle cannot contain two inputs of Bob with 
different x(Q*)> tnat i s > \%b\ = 1 f° r a ^ va hd Thus the transcript for a coloring x(Q*) * s unique with no 
smaller entropy. □ 

We will use Lemma|3]to obtain a communication protocol, setting the rate of false positives in the Bloom 
filter to p = 1/ lg n. The cells written in Alice's interval are precisely Wi\ the cells read in Bob's interval are 
-R<i U R* where R* is the union of the cells read by all the metaqueries. By Lemma |3l the communication 
complexity is: 

EfK^iUi^JnWil-OOgn) + Wi-0{\glgn) + ^\R Ki U R*\ ■ OQgn)] 
< E[\R*r\Wi\] -O(lgra) + 0(MB% -lglgn) + 0(MB i ^ 1 t u • lgn) + 0(\R*\) 

We compare this to the lower bound of Q,(MB l lgC) = Q(MB l - sign) from Claim [9] Remember that 
t u = o(e 2 lg n/ lg lg n), so the second term is o(MB l e 2 lg n), which is asymptotically lower than the lower 
bound. Also, we set B = lg 2 n, so the third term is o(MB l ). Finally, we have \R*\ = 0(B l Mt u ). To see 
this, recall that \R*\ < B l (Mt u + Ct q ), so if the statement was false, we would have B l Ct q = oj(B 1 M) and 
t q = uj(M/C) = uj(n l ~ 2e ). Since 0(B l Mt u ) is also low order term, the first term must dominate, which 
means E[|,R* n (W* \ W<i)|] = U^MB^). Therefore, E[\R* n (Wi \ W<i)|] = n(eM). This completes 
the proof of Lemma [6] from which we got our lower bound for incremental connectivity. 



4 Lower Bound for Dynamic Connectivity 

Theorem 10. Any data structure for dynamic connectivity in graphs of n vertices that has (amortized) 
update time t u = o(lg n) must have (amortized) query time t q > n l ~ o(yl \ 

Let e be such that t u = o(e 2 lgn), and define M = n x ~ e and C = n e . The shape of our graphs is 
depicted in Figured] The vertices are points of a grid [M] x [n/M\. The edges of our graph are matchings 
between consecutive columns. Let tt\, . . . , ir n /M-i De the permutations that describe these matchings. We 
let 7r<j = TTj o 7Tj_i o • • ■ o 7Ti. Node i in the first column is connected in column j + 1 to ir<j(i). 

The graph also contains C special vertices, which we imagine are colored with the colors 1, . . . , C. At 
all times, a colored vertex is connected to a fixed set of Mj C vertices in the first column. (For concreteness, 
colored vertex 1 is connected to vertices 1, . . . , M/C; colored vertex 2 to the next M/C vertices; etc.) 

We will allow two meta-operations on this graph: Update and Query. Initially, all permutations are 
the identity (i.e. all edges are horizontal). Update (j, TT new ) reconfigures the edges between columns j and 
j + 1: it sets TTj to the permutation TT new . This entails deleting M edges and inserting M edges, so Update 
takes time 2M • t u . 

Query (j, x) receives a vector x £ [C] , which it treats as a proposed coloring for vertices on column 
j. The goal of the query is to test whether this coloring is consistent with the coloring of the vertices in the 
first column. More specifically, a node % of color a in the first column must have xt^j'C*)] = a. A Query 
can be implemented efficiently by connectivity operations. First each vertex i in column j is connected to 
the colored vertex x[i]- Then, for i = 2 . . M, we run a connectivity query to test whether colored vertex i is 
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Figure 1: The shape of our graphs. 

connected to colored vertex i — 1. If so, Query return false. Otherwise, it inserts an edge between colored 
vertices i and i — 1 and moves to the next i. At the end, Query deletes all vertices it had inserted. The total 
cell-probe complexity of Query is O(M) ■ t u + C ■ t q . It is easy to observe that this procedure correctly 
tells whether the colorings are consistent (as in our instance of incremental connectivity). 

We will now describe the hard distribution over problem instances. We assume -jg — 1 is a power of 
two. Let a be the bit-reversal permutation on {0, . . . , — 2}: a(i) is the reversal of i, treated as a vector 
of log 2 (jj — 1) bits. For i = 0, . . . , jj — 1, we execute an Update to position j = a(i) + 1, and a Query 
to the same position j. The update sets itj to a new random permutation. The query always receives the 
consistent coloring, and should answer true. The total running time is 

T < n/M(2Mt u + 0{M)t u + Ct q ) = 0{nt u + (n/M)Ct q ). 

If we can prove a lower bound T = uj{nt u ), then this will yield a high lower bound for t q . 

For the lower bound proof, we consider a perfect ordered binary tree with n/M — 1. The leaves are 
associated with the pairs of Update and Query operations in time order. Let W(v) (respectively R(v )) 
be the set of cells written (respectively, read) while executing the operations in the subtree of v. Note that 
W(v) C R{v ), since we have assumed a cell must be read before it is written. Our main claim is: 

Lemma 11. Let v be a node with 2k leaves in its subtree, and let vl, vr be its left and right children. Then 
V[\W(v L )nR(v R )\ + T ^\W(v L )UR(v R )\]=n(k-eM). 

Before we prove the lemma, we use it to derive the desired lower bound. We claim that the total expected 
running time is T > J2 V E[|W(vl) n R(vr)\], where the sum is over all nodes in our lower bound tree. 
Consider how a fixed instance is executed by the data structure. We will charge each read operation to a 
node in the tree: the lowest common ancestor of the time when the instruction executes, and the time when 
the cell was last written. Thus, each W{vt) H R(vr) corresponds to (at least) one read instruction, so there 
is no double-counting in the sum. 

We now sum the lower bound of Lemma [TTI over all nodes; observe that k v = lg jj), since 
the tree has n/M - 1 leaves. We obtain ^ v E[\W{v L ) n R(v R )\] + ^ E[\W{v L ) U R(v R )\] = 
lg -ji ■ sM). The first term is at most T, as explained above. In the second term is also bounded by 
T. This is because J2 V E[| W(v.z,) U R(v R )\] <Tlgjj since every cell probe is counted once for every 
ancestor of the time it executes. Thus 2T > f2(4ylg-||° • eM) = Q,{e 2 n\gn). In our construction, the 
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total running time was T = 0(nt u + jjCt q ). Since t u = o(e 2 lgn), the second term must dominate: 
^-t q = n{e 2 n\gn), so t q > M/C = n 1 " 26 = n 1 ' ^. 

Proof of Lemma [HJ We will prove the stronger statement that the lower bound holds no matter how we 
fix the updates outside node vl- 

We transform the problem into the natural communication game: Alice receives the update permutations 
in the subtree vl and Bob receives the colorings of the queries in the subtree vr (the updates are fixed). They 
have to check whether all queries are positive in the sequence of Update and Query operations defined by 
their joint input. 

We apply Lemma [4] to construct a nondeterministic communication protocol for this problem, with 
complexity E[|W(«l) n R(v R )\ ■ O(lgn) + 0(\W(v L ) U R(v R )\)]. The conclusion of Lemma [ID follows 
by comparing this protocol to the following communication lower bound: 

Lemma 12. The game above has nondeterministic (average-case) communication complexity Q(kAl lg C). 

Proof. Let Xa and Xb be the inputs of the two players. For any choice of Xa, there is a unique sequence 
of colorings Xb that Bob should accept. As in the proof of Lemma |9j we conclude that the public proof is 
an encoding of Xb so we can lower bound the complexity via H(Xb). 

Let J a and Jb be the columns touched (updated and queried) in Alice's input and in Bob's input. Bob's 
input consists of the coloring of column j, for each j G Jb- This is 7r<j applied to the fixed coloring in the 
first column. 

Since Ja and Jb are defined by the bit-reversal permutation, we know that they interleave perfectly: 
between every two values in the sorted order of Jb, there is a unique value in Ja- Thus, the coloring for 
different j G Jb are independent random variables, since an independent uniform permutation from Ja 
is composed into 7r<j compared to all indices from Jb below j. Each coloring is uniformly distributed 
among balanced colorings, so it has entropy M lg C — 0(C lg M) (c.f. proof of Claim©. We conclude that 
H(X B ) = VL{kM\gC). □ 

5 Amortized link-find bounds 

In this section we consider the amortized complexity of the link-find problem which is like the union-find 
problem except that we can link arbitrary nodes, not just roots. In link-find, we may not necessarily have an 
obvious notion of a root that we can find. The fundamental requirement to a component is that if we call find 
from any vertex in it, we get the same root as long as the component is not linked with other components. 

Let u be the number of updates and q the number of queries. With union-find, the complexity over the 
whole sequence is @(a(q,u)q) if q > u, and @(a(q,q)q + u) if q < u. With link-find, we get the same 
complexity when q > u, but a higher complexity of @(a(q, u)u) when q < u. Thus, with link-find, we get 
a symmetric formula in q and u of 

0(a(max{g, u}, min{g, u}) m&x{q, u}). (1) 

We get the upper-bound in © via a very simple reduction to union-find. 

5.1 The link-find data structure 

Nodes have three types: free, leaf, and union nodes. A leaf node has a pointer to a neighboring union node, 
and the union nodes will participate in a standard union-find data structure. The parent of a leaf is the union 
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node it points to. The parent of a union node is as in the union-find structure and the parent of a root is the 
root itself. 

All nodes start as free nodes. We preserve the invariant that if a component has a free node, then all 
nodes in the component are free. 

To perform a find on a free node v, we scan the component of v. If it is a singleton, we just return it. 
Otherwise, assuming some initial tie-breaking order, we make the smallest node in the component a union 
node and all other nodes leaf nodes pointing to is. The union node which is its own root is returned. All this 
is paid for by the nodes that lost their freedom. 

To perform a find on a non-free node, we perform it on the parent which is in the union-find data 
structure. 

We now consider the different types of links. When we perform link between two free nodes, nothing 
happens except that an edge is added in constant time. 

If we link a free node v with a non-free node w, we make all nodes in the components of v leaves 
pointing to the parent of w. This is paid for by the new leaves. 

If we link two non-free nodes, we first perform a find from their parents which are union nodes. If they 
have different roots we unite them. 

This completes the description of our link-find data structure which spends linear time reducing to a 
union-find data structure. A union node requires a find on a non-singleton node, so the number of union 
nodes is at most min{q, u}. Concerning finds in the union-find data structure, we get one for each original 
find on a non-free node. In addition, we get two finds for each link of two non-free nodes, adding up to at 
most q + 2u finds. Our total complexity is therefore 

0(u + q + a(q + 2u, min{g, u})(q + 2u)) = 0(a(max{q, u}, min{g, u}) max{q, u}). 

We are going to present a matching lower bound. 

5.2 The link-find data structure for a forest 

We will now show that it is the links between nodes in the same components that makes link-find harder 
than union-find in the sense that if no such links appear, we get the same O-bound as with union-find. 

The modification to the above link-find reduction is simple. Using standard doubling ideas, we can 
assume that u and q are known in advance. If q > u, we are already matching the union-find bound, so 
assume q < u. 

To do a find on a free node, we again scan its component. However, if it has less than a(q, q) nodes, we 
just return the smallest but leaving the component free. Otherwise, as before, we make the smallest node a 
union node and all other nodes leaf nodes pointing to it. This is the only change to our link-find algorithm. 

In the case where the component has a(q,q) nodes, we clearly pay only 0(a(q, q)) for a find. The 
advantage is that we now create at most u/a(q,q) union nodes. Links involving a free node have linear 
total cost, and now, when we perform a link of non-free nodes, we know they are from different components 
to be united, so this will reduce the number of union roots by one. Hence we get at most 2u/a(q, q) finds 
resulting from these links. Thus, in the union-find data structure, we end up with q + 2u/a(q, q) finds and 
u/a(q, q) unions. The total cost is 

0(u + q + a(q + 2u/a(q, q),u/a(q, q))(q + 2u/a(q, q)) = 0(a(q, q)q + n) 

time. The simplification uses that a is increasing in its first and decreasing in its second argument, and that 
the whole time bound is linear if q < u/a(q, q). 
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Appendix a. Lower Bounds for Amortized Link-Find 



We will now sketch a proof for the lower-bound in ([T) with u link updates and q find queries. When q > u, 
we get this from the union-find lower bound of Q(a(q, u)q) from [FS89]. However, for g « «, we need to 
prove a higher lower-bound than that for union-find. The lower bound we want in this case is Q(a(u, q)u). 

We would get the desired lower bound if we could code a union-find problem with Q(q) updates and 
Q(u) queries. We cannot make such a black-box reduction, but we can do it inside the proof construction 
from [FS89]. We will only present the idea in the "reduction". For a real proof one has to carefully examine 
the whole proof from [FS89] to verify that nothing really breaks. 

The lower bound construction from [FS89 1 proceeds in rounds. We start with singleton roots. In a union 
round, we pair all current roots randomly, thus halving the number of roots. In a find round, we perform a 
number of finds on random leaves. The number of finds are adjusted depending on the actions of the data 
structure. From [KST02a | we know that the lower bound also holds if the finds just have to verify the current 
root of a node. 

In our case, we will start with n roots. In a union-round, we just link roots as in union-find. However, in 
a find round, instead of calling find from a leaf v, we link v to its current root r. We want to turn this leaf -root 
link into a verification. We will not do that for the individual links, but we will do it for the find-round as a 
whole (one needs to verify that this batching preserves the lower-bound). At the end of the find-round, we 
simply perform a find on each root. All these finds should return the root itself. If one of the links (v, r) 
had gone to the wrong root and r' was the correct root, then r and r' would be connected in the same tree, 
which means that they cannot both be roots. One of the finds would therefore return a different root. If 
the union-find problem we code used / finds, then our link-find solution ends up with u = n — 1 + / link 
updates and q = n — 1 find verifications, hence with the desired lower bound of 

ft(a>{f,n)f) = n(a(u,q)u). 
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CNJ ' Abstract 

We present a new threshold phenomenon in data structure lower bounds where slightly reduced 
update times lead to exploding query times. Consider incremental connectivity, letting % be the time to 
insert an edge and t q be the query time. For t\j = Q(tq), the problem is equivalent to the well-understood 
t^. . union-find problem: InsertEdge(s, t) can be implemented by UNION (FIND (s), FlND(t)). This gives 

' worst-case time t\j = t q = 0(\gn/ lglgn) and amortized tu = t q = 0(a(n)). 

By contrast, we show that if % = o(lg n/ lg lgn), the query time explodes to t g > n 1-0 ' 1 ). In other 
| words, if the data structure doesn't have time to find the roots of each disjoint set (tree) during edge 

^— ^ . insertion, there is no effective way to organize the information! 

For amortized complexity, we demonstrate a new inverse- Ackermann type trade-off in the regime 

Q ■ tv = 0(t q ). 

!— 1 ' A similar lower bound is given for fully dynamic connectivity, where an update time of o(lg n) forces 

. the query time to be n 1 ^ ^. This lower bound allows for amortization and Las Vegas randomization, 

and comes close to the known C>(lg n ■ (lg lg n)°^ ) upper bound. 

m ' 

!>■ ! 1 Introduction 

CN . We present a new threshold phenomenon in data structure lower bounds where slightly reduced update times 

lead to exploding query times. Previous trade-offs where smooth and much weaker. The new explosive lower 

^_ ■ bounds are found hidden in some very well-studied problems: incremental and fully-dynamic connectivity. 

For incremental connectivity, the explosion is only in the worst-case. For the fully-dynamic case we also 

• »~j | get an explosion in the amortized bounds. 

rS ■ 

1.1 Incremental connectivity and union-find 

The incremental connectivity problems considers a graph G with vertex set {1, ...,n}. Starting with no 
edges in G, we support the following operations: 

Connected (u, v) : Tells if u and v are connected in G. 
InsertEdge(u, v) : Adds the edge (it, v) to G. 

The obvious way to implement incremental connectivity is the classic union-find data structure which 
supports the following operations on a collection of disjoint sets, starting from n singleton sets {1}, . . . , {n}: 

Find(u) : Return a "root" representing the set containing v. 

Union(u, v) : Both u and v should be roots of sets, i.e. Find(u) = u and Find(v) = v. The operation 
creates the union of the sets and returns its new root. 
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The terminology for the union-find problem stems from the usual implementation as a forest of rooted 
trees where each tree spans a set. Find involves walking up to the root of v's tree, potentially doing useful 
work like path compression on the way. UNION simply adds an edge between the roots whose direction is 
usually determined by the size or rank of sets. 

Given a union-find data structure, we can implement incremental connectivity as: 

Connected (u, v) : IsFind(«) =Find(i>)? 
InsertEdge(u, v) : Union (FiND(it), Find (v)). 

For now our focus is on worst-case bounds, but shall return to inverse-Ackermann style amortized 
bounds in Section [5] From a worst-case perspective, the classic union-by-rank gives union in constant 
time and find in 0(log n) time. Trade-offs were addressed by Blum [6 ], with an improvement by Smid lfl4l . 
They show that, if the time for union is bounded by t\j, Find can be supported in worst-case 0(lg nj lg t\j). 
This trade-off is known to be optimal in the powerful cell-probe model 1 1 ] (see below for a review of the 
lower bounds). 

Implementing incremental connectivity with union-find, we do not benefit if Union is faster than Find. 
The natural solution is to balance the times, supporting all operations in 0(log nj log log n) time. But do 
we really need to take time to find the roots before entering the union? What happens if we try to reduce 
the update time, e.g., can we as in the plain union-find problem do updates in constant time and queries in 
logarithmic time? The answer is a resounding no: any improvement in the 0(log nj log log n) update time 
for insertions will make the connectivity query time explode. 

Theorem 1. Any data structure for incremental connectivity over n vertices that supports edge insertions 
in worst-case time o{ ^Q n ) must have worst-case connectivity query time n 1 ^ ^ in the cell-probe model 
with cells ofO(lg n) bits. 

1.2 Combining union with general links 

To describe the full power of our lower bound, we will augment union-find with a natural link operation. To 
formally define the problem, we need a convention on the representatives in union-find. When we start, all 
elements are singletons, each being its own representative. Now define: 

Link(u, v) : Here u and v are arbitrary elements of sets. If these sets are different (Find (it) ^ Find(u )), 
the operation creates a new set that is the union of the two old sets, whose representative is the old 
Find(u). The two old sets are destroyed. Thus, if r = Find (it) before the call, then afterwards 
Find(u) = r = Find(u). 

Union(u, v) : This operation is a special case of Link(u, v), where the caller of the function makes a 
guarantee that before the call, both u and v are the representatives of their own sets, i.e., Find(u) = u 
and Find(-u) = v. Afterwards, Find (it) = u = Find(-u) 

This semantics of choosing representatives can be supported by any union-find data structure with a 
constant overhead, as one can maintain a translation table between the representatives chosen by the data 
structure and the ones mandated in our definition. Now Link is a direct generalization of Union to the 
case where the arguments may not be representatives. It has the obvious implementation Link(u, v) = 
Union(Find(u),Find(?;)), with the running time i L < *u + 2i F- 

If we choose to support Union in time t\j, the standard union-find data structures can support Find in 
i F = O(y^) time. This immediately implies a running time for Link of = 0(t\j + tp) = 0(t\j + 
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Our basic question is whether Link "requires" a Find operation, i.e. whether one can support ti, = 
o(tp). This question only arises in the regime t\j = o( ^™ n ), as otherwise £tj dominates the running time 
of £l (which is necessary, as UNION is a special case of Link by definition). As a particular example, if we 
support Union in constant time, both Link and Find take O(lgn) time by known results. An intriguing 
question is: with Union in constant time, can we implement links in sublogarifhmic time while preserving 
efficient (say, polylogarithmic) query time? 

To prove the strongest lower bounds, we prefer to work with a weaker query than Find: 

Connected (u, v) : Are u and v in the same set, i.e. Find (it) = Find(u)? 

The link-connected problem is identical to incremental connectivity. Lower bounds for CONNECTED 
queries immediately translate into lower bounds for Find queries. From now on, we consider the mixed 
union-link-connected problem (whose update/query trade-offs turn out to be identical to union-link-find). 
We prove that if Link does not have enough time to run a Find query (with the classic running times), the 
data structure cannot organize information effectively and the connectivity query time explodes to essentially 
linear time: 

Theorem 2. Any data structure supporting UNION in t\j = o( ^™ n ) worst-case time and Link in = 

°( iog"u ) worst-case time, must have worst-case CONNECTED (and Find) query time £q > n l ^°^ in the 
cell-probe model with cells ofOilgn) bits. 

With union and link running in o(log n/ log log n) time, the theorem reproves the query lower bound of 
Theorem Q] 

A question of similar flavor appeared in connection with union-find with deletions. Links and deletions 
have the common issue that they apply to arbitrary elements, hence that we do not a priori know what tree 
we are in. Kaplan et al. iflOl considered the union-delete-find problem but wanted local bounds were n is 
the size of the actual trees involved in an operation. All the above mentioned worst-case bounds are local, 
but this would be lost if we tried implementing deletions with global rebuilding. Kaplan et al. flOl showed 
how to augment union-find with a delete operation if we when deleting an element x, first find the root and 
then perform a local rebuilding step in the tree that x is deleted from. With union in constant time, they 
implemented both find and delete in O(logn) time. Similar to our case, they asked if the deletion time 
could be improved while preserving the logarithmic query time. In the case of deletions, the answer was the 
strongest possible "yes." Alstrup et al. [2 1 proved that both unions and deletions could be supported locally 
in constant time while preserving the local logarithmic query time. 

1.3 Fully-dynamic connectivity 

We show a similar computational phenomenon for fully dynamic connectivity with both edge insertions and 
deletions. In this fully-dynamic case, we hit the wall even if amortization is allowed. 

Theorem 3. Any data structure for fully dynamic connectivity in a graph ofn vertices with update time t u = 
o(lgn) must have query time t q > n 1 ^"^. This bound allows amortization and Las Vegas randomization 
(expected running times), and holds in the cell-probe model with cells ofO(lgn) bits. 

Thorup |[T8l has an almost matching upper bound of t\j = 0(lgn • (lglgra) 3 ) and t q = o(lgra). This 
data structure uses both Las Vegas randomization and amortization. 



3 



1.4 Lower Bounds 



Many of the early lower bounds for union-find were in (restricted versions of) the pointer machine model 

In STOC'89, Fredman and Saks (H were the first to show dynamic lower bounds in the cell-probe 
model. They studied the partial sums problem and the union-find problem. The partial sums problem asks 
to maintain an array A[l . . n] under pointwise updates and queries for a prefix sum: Yli<k ^H- ^ or P art ial 
sums and for worst-case union-find, Fredman and Saks showed a lower bound of t q = Q[\gn/\g(t\j lg n)). 
For amortized union-find, they gave an optimal inverse-Ackermann lower bound. A different proof of the 
same bounds was given by Ben-Amram and Galil in FOCS'91 JH. 

In STOC'99, Alstrup, Ben-Amram and Rauhe [1] improved the trade-off for union-find to t q = 
Q(lgn/ lgtu)> which was also the highest known trade-off for any problem. In STOC'02, Kaplan, Shafrir 
and Tarjan [9] showed that the optimal worst-case and amortized trade-offs for union-find also hold for a 
weaker Boolean version where the user specifies set identifiers and where we only have membership queries. 
From a lower bound perspective, the tricky part is that the query output is a single bit. Identifiers can always 
be viewed as special elements of sets. Thus they get the same lower bound trade-off for incremental connec- 
tivity: edges are only added between current set identifiers, and connectivity queries are between arbitrary 
nodes and current set identifiers. This lower-bound trade-off for incremental connectivity is tight when 
t\j = Q(t q ), matching the previously mentioned upper-bounds for link-find. However, by our Theorem Q] 
the incremental connectivity queries hit a wall when the update time becomes lower. 

The work of Patra§cu and Demaine from STOC'04 [13] gives the best trade-offs known today, for 
any explicit problem. They considered partial sums and fully dynamic connectivity, and showed that, 
if m&x{tu,t q } = O(Blgn), then min{tu 5 ^} = ^(log B n). In particular, their bounds implied 
max{tu, tq} = S^(lg n), whereas previous results implied max{t\j, t q } = ft(lg n/ lg lg n). 

These bounds are easily seen to be optimal for the partial sums problem. The standard solution is to 
create an ordered binary tree with leaf set [re] ; each internal node maintains the sum of its children. Updates 
and queries are trivially supported in 0(logn) time. To get a trade-offs, we can instead use a .B-tree with 
degree B. The time of an update is the height of the tree, which is Oi\og B re). However, to answer a query, 
we need to add up all left siblings from the path to the root, so the query time is 0{B \og B n). 

Our results significantly improve the known trade-offs in the regime of fast query times. Note that the 
previous strongest bounds from [13 ] could at most imply t q = Q(n £ ) even for constant update time. Here e 
depends on the constant in the update time. For example, allowing only 4 cell probes for the updates, lfl3l 
careful inspection] gets a query lower bound of Q(n^ ). Our Theorem [3] says for another problem that we 
with o(logre) probes get a query lower bound > re 1-0 ^ 1 ) queries. 

The trade-offs of lfl~3l are optimal in the full range for the partial sums problem. For incremental and 
fully dynamic connectivity, the previous mild trade-offs (3 [13] are optimal in the regime t\j » t q ; it is only 
the regime of fast updates that causes the abrupt transitions in Theorems [T] and [3] 

Lower bounds beyond the balanced tree The previous lower-bounds we discussed are essentially all 
showing that the we cannot do much better than maintaining information in a balanced tree. All operations 
follow well-understood paths to the roots. Trade-offs were obtained by increasing the degree, decreasing the 
height: the faster of updates and queries would just follow the path to the root while the slower would have 
to consider siblings on the way. The lower bounds from [13] are best possible in this regard. 

Our stronger trade-offs for incremental and fully-dynamic connectivity shows that there is no such sim- 
ple way of organizing information; that the links between arbitrary vertices changes the structure too much 
if the update times is not long enough, we cannot maintain the balanced information tree. 
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2 Simulations by Communication Games 



Generally, for the data structure problems considered, we are going to find an input distribution that will 
make any deterministic algorithm perform badly on the average. This also implies expected lower bounds 
for randomized algorithms. 

Consider an abstract dynamic problem with operations Update(uj) and QiJERY((/j). Assume the se- 
quence of operations is of fixed length, and that the type of each operation (query versus update) is fixed 
a priori. The "input" u-i or of the operation is not fixed yet. Let I a and Ib be two adjacent intervals of 
operations, and assume that every input ui or qi outside of I a U Ib has been fixed. What remains free are the 
inputs Xa during interval I a and Xb during interval Ib- These inputs (Xa, Xb) follow a given distribution 
V. 

It is natural to convert this setting into a communication game between two players: Alice receives Xa, 
Bob receives Xb, and their goal is to answer the queries in Xb (which depend on the updates in Xa)- In 
our applications below, the queries will be Boolean, and it will even be hard for the players to compute the 
and of all queries in the Ib interval. Each player is deterministic, and the two players can exchange bits of 
information. The last bit communicated should be the final answer of the game, which here is the and of 
the queries in Ib- The complexity of the game is defined as the total communication (in bits) between the 
players, in expectation over V. 

We will work in the cell-probe model with w-bit cells; in the applications below, w will be ©(lgn). Let 
Wa be the sets of cells written / read during time interval I a- We consider all cells touched by the algorithm 
during interval Ib in order. If a cell is read before being written, we include it in the set Rb', if a cell is first 
written, include it in the set Wb- 

Lemma 4. For any p > 0, the communication game can be solved by a zero-error protocol with complexity 
~E>v[\Wa\ ■ 0(lg |) + 0(w) ■ (\Rb n (Wa \ Wb)\ + p\Rb\)]- 

Proof. Alice first simulates the data structure on the interval I a- The memory state at the beginning of I a 
is fixed. After this simulation Alice constructs a Bloom filter Q with error (false positive) probability p for 
the cells Wa- The hash functions needed by the Bloom filter can be chosen by public coins, which can later 
be fixed since we are working under a distribution. Alice's first message is the Bloom filter, which requires 
|W A |-0(lgi)bits. 

Bob will now attempt to simulate the data structure on Ib- The algorithm may try to read a cell of the 
following types: 

• a cell previously written during Ib- Bob already knows its contents. 

• a cell that is positive in the Bloom filter: Bob sends the address of the cell to Alice, who replies with 
its contents; this exchange takes 0(w) bits. 

• a cell that is negative in the Bloom filter: Bob knows for sure that the cell was not written during I a- 
Thus, he knows its contents, since it comes from the old fixed memory snapshot before the beginning 
Of/A- 

With this simulation, Bob knows all the his answers and can transmit the final bit telling if they are all true. 
The number of messages from Bob is \Rb fl (Wa \ Wb) \ (true positives) plus an expected number of false 
positives of at most p \ Rb \ ■ □ 

We will use the simulation to obtain lower bounds for \Wa H Rb\, comparing the complexity of the 
protocol with a communication lower bound. This simulation works well when \Wa H Rb\ ~ \Wa U 
i?e|/ igfgn ' since we can use p « j-^, and make the term \Wa H Rb \ dominate. Unfortunately, it does not 
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work in the regime \Wa n Rb\ ~ |Wa U i?s|/lgn, since one of the terms proportional to \Wa\ or \Rb\ 
will dominate, for any p. 

To give a tighter simulation, we use a stronger communication model: nondeterministic complexity. In 
this model, a prover sends a public proof Z to both Alice and Bob. Alice and Bob independently decide 
whether to accept the message, and they can only accept if the output of the communication game is "true" 
(i.e. all queries in Ib return true). In this model Alice and Bob do not communicate with each other. Alice's 
answer is a deterministic function /a{Xa, Z) of her own input and the public proof. Similarly, we have 
Bob's answer /b{Xb, Z). For the protocol to be correct, Ja(Xa, Z) and Jb(Xb, Z) may only both be true 
if this is the answer to the game. 

Our goal for the prover is to define a short public proof Z(Xa, Xb) that will lead Alice and Bob to the 
desired answer /a(Xa, Z(Xa, Xb)) A Jb{Xb, Z(Xa, Xb))- The complexity of the protocol is the of the 
game should be the and of all queries in Ib- Since we are working under a distribution, the bit length of 
the prover's message Z(Xa, Xb) is a random variable, and we define the complexity of the protocol as its 
expectation. 

Lemma 5. The communication game can be solved by a nondeterministic protocol with complexity 
E v [0(w)-\W A nR B \ + 0{\WAURB\)}. 

Proof. We will use a retrieval dictionary (a.k.a. a Bloomier filter, or a dictionary without membership). 
Such a dictionary must store a set S from universe U with k bits of associated data per element of S. When 
queried for some x G S, the dictionary must retrieve x's associated data. When queried about x £ S, it may 
return anything. One can construct retrieval dictionaries with space 0(k\S\ + lglg |?7|); see e.g. 171 . 

The message Z(Xa,Xb) of the prover will consist of the addresses and contents of the cells X = 
\Wa H Rb\, taking 0[w) bits each. In addition, he will provide a retrieval dictionary for the symmetric 
difference WaARb = (Wa \ Rb) U {Rb \ Wa)- In this dictionary, every element has one associated bit of 
data: zero if the cell is from Wa \Rb and one if from Rb\ Wa ■ The dictionary takes O (lg w + 1 Wa URb\) 
bits. 

Alice first simulates the data structure on I a- Then she verifies that all cells X were actually written 
(X C Wa), and their content is correct. Furthermore, she verifies that for all cells from Wa\X, the retrieval 
dictionary returns zero. If some of this fails, she rejects with a false. 

Bob simulates the data structure on Ib- The algorithm may read cells of the following types: 

• cells previously written during Ib- Bob knows their contents. 

• cells from X: Bob uses the contents from public proof (Alice verified these contents). 

• cells for which the retrieval dictionary returns one: Bob uses the contents from the fixed memory 
snapshot before the beginning of I a (Alice verified she didn't write such cells). 

• cells for which the retrieval dictionary return zero: Bob rejects. The prover is trying to cheat, since in 
a correct simulation all cells of Rb \ X has a one bit in the dictionary. 

If neither player rejects, we know that Rb \ X is disjoint from Wa \ X, so the simulation of Bob is correct. 
Finally Bob rejects if any of his answers are false. □ 

3 Lower Bound for Union-Link-Find and Incremental Connectivity 

This section proves Theorem 
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Any data structure supporting UNION in ty = o( ^™ n ) worst-case time and LINK in ty = 

°( log"u ) worst ~ case time, must have worst-case CONNECTED query time £q > n l ~°^ in the 
cell-probe model with cells ofw = 0(lg n) bits. 



With hindsight, define: 
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Finally, we define two parameters C = n £ and M = n e . 

The starting point of our hard instance is essentially taken from Fredman and Saks' seminal paper J8]. 
The hard instance will randomly construct a forest of M trees. Each tree will be a perfect tree of degree B 
and height log B (n/M). On layer of the forest we have the M roots. On layer i, we have exactly M ■ B l 
vertices with B l vertices from each tree. 

We can describe the edges between level i and i — 1 as a function /j : [M ■ B l ] — > [M ■ B 1 ^ 1 ] that 
is balanced: for each x € [MB ], \(fi) (x)\ = B. We will use the following convenient notation for 
composition: />j = /;o f i+1 o • • • . For example, the ancestor on level i — 1 of leaf x is f>i(x). 

Our hard instance will insert the edges describing /j's in bottom-up fashion (i.e. by decreasing i, from 
the largest level up to the roots). We call "epoch i" the period of time when the edges /j are inserted. Let 
Wi (respectively R{) be the cells written (respectively, read) in epoch i. All the above edges are added in 
union-find style from roots of current trees, i.e. the hard instance only runs UNION operations for this part 
of the construction. More precisely, before epoch i the future roots . on level i — 1 are singletons, and 
for each level i node Vij, we execute UNiON(u i _ 1 j.( J -), Vij). It follows that \Wi\ + \Ri\ < M ■ B l t\j. We 
will use the following convenient notation for set union: W<i = U,-<i Wj. The cells W{ \ are those 
last written in epoch i. 

The above constitutes the hard case for union-find from JSJ. At this point [8] show that running Find 
on a random leaf requires reading cells from most epochs, hence forcing the expected time of Find to be 



Our goal is to show that linking arbitrary vertices may lead to much more expensive queries (even if we 
only allow Boolean connectivity queries). We will describe some very powerful metaqueries that combines 
links to roots and leaves with a few connectivity queries to reveal far more information than if we only had 
the regular connectivity queries. The metaqueries will be provably hard to answer, so if the links are done 
too quickly, the queries must be very slow. 

Our graph contains C additional special vertices a±,...,ac, conceptually colored with the colors 1 . . C. 
Each colored vertex aj is now linked to M/C nodes on level (the final roots of our trees). This is done in 
a fixed pattern: colored vertex 1 is connected to roots 1, . . . , M/C; colored vertex 2 to the next M/C roots; 
etc. 

We say the root color of a vertex is the color that its root is connected to. Conceptually, the hard 
distribution colors a random set Q of exactly M leaves and verifies that these are the root colors. To 



n(lgn/IgS). 
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implement this test, we run Link between each query leaf and the proposed colored vertex. Then, for 
i = 2 . . C, we run a Connected query asking whether colored vertex i is connected to colored vertex 
i — 1, followed by inserting an edge between these two color vertices. The metaquery returns "true" iff all 
connectivity queries return negative answers. 

We claim that if the metaquery answers true, the coloring of Q must be consistent with the coloring 
of the roots. Indeed, if some leaf is colored i and its root is colored j ^ i, this inconsistency is caught at 
step max{i, j}. At this step, everything with color < max{i, j} — 1 has been connected into a tree, so the 
connectivity query will return true. 

Let x(Q) be the coloring of leaves in Q that matches their root colors. In the hard distribution, the 
metaquery always receives proposed colors from so it should answer true. Nevertheless, the data 

structure will need to do a lot of work to verify this. Let BP be the cells read during the metaquery. We 
have \E9\ < C ■ tq + 2M ■ t L . The main claim of our proof is: 

Lemma 6. For any i £ {1, . , . , \og B (n/M)}, we have E[\R Q n (Wi \ W<j)|] = Q,(eM). 

Before we prove the lemma, we show how it implies our lower bound. The sets Wi \ W<i are disjoint 
by construction (they contain the cells last written in epoch i), so E[|i?^|] > Yli E[|i?^ n (Wi \ W<j)|]. 
Remember that we have \og B (n/M) = 0(log B (n £ ) / Ig B) = 0{e\og B n) epochs. Thus E[|i?^|] = 
Q(M ■ e 2 log B n). We compare this to the worst-case upper bound \B9\ < C • tq + 2M ■ ti,. By assumption 
of Theorem|2l £l = )■ By © we have tL < £ 3 log^ n = o(e 2 log B n), ensuring that the second term 

of the upper bound is negligible. It follows that C-tq = Q(M-e 2 log B n), hence tq = Q(e 2 M/ C log B n) = 

Proof of Lemma|6] Fix i. We will prove the stronger statement that the lower bound holds no matter how 
we fix the edges outside epoch i (all fj's for j ^ i). 

To dominate the work of later epochs i — 1, . . . , 1, we consider B l i.i.d. metaqueries. Choose sets 
Q 1 , Q 2 , . . . , Q B ' independently, each containing M uniformly chosen leaves. Starting from the memory 
state where all trees are completely built and the roots have been colored, we simulate each metaquery 
(Q- 7 , x(Q- 7 )) in isolation. We do not need to write any cells in this simulation, for the cell-probe model has 
unbounded state to remember intermediate results and in our hard distribution there is no operation after the 
metaquery. Thus the simulations of the different metaqueries do not influence each other. Let R* be the 
cells read by all B l metaqueries. By linearity of expectation, 

E[\R* n (Wi \ W<i)|] < B { • E[\R Q n (Wi \ W<i)|]. (3) 

Let Q* = (J • Qi . Since we have fixed all />j, asking about the root color of a leaf q G Q* is equivalent 
to asking about the root color of node f>i(q) on level i. 

Claim 7. We have E[|/>i(Q*)|] > (1 - \)MB\ 

Proof. Each leaf x in some Qi is chosen uniformly, so its ancestor f<i(x) is also uniform. The M ■ B % trials 
are independent (for different Qi , Q k ), or positively correlated (inside the same Q J , since the leaves must 
be distinct). Thus, we expect to collect (1 — l/e)MB l distinct ancestors. □ 

By the Markov bound |/>i(Q*)| > \MB % with probability at least 1 — 2/e. Thus we may fix the 
sequence (Q 1 , Q 2 , . . . , Q B *) to a value that achieves |/>i(Q*)| > \MB i while increasing E[|i?* n (Wj \ 
W<i)|] by at most (1 - 2/e)- 1 = O(l). 
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The only remaining randomness in our instance are the edges /, from epoch i and the proposed colorings 
x(Q J ) given to each metaquery Qf To be valid, these colorings are functions of /j, for as soon as we know 
fi, we know the whole forest including the root colors of all the leaves in the different Qf The metaquery 
colors have to agree on common leaves, so they provide us a coloring x(Q*)- With /j yet unknown, we 
claim that x(Q*) has a lot of entropy: 

Claim 8. H(x(Q*)) = ft(M-BMgC). 

Proof. Let X be the unknown coloring of all vertices on level i. We claim it has entropy H(X) = MB 1 ■ 
log 2 C — 0(C lg n). We have not fixed anything impacting this coloring so X is a random balanced vector 
from [C] MB * . Indeed, any balanced coloring is equiprobable, because the coloring of the roots is balanced, 
all trees have the same sizes, and f is a random balanced function. We claim that it has entropy H(X) = 
MB 1 • log 2 C — 0(Clgn). The number of balanced colorings is given by the multinomial coefficient 
KMByc MB i /c )■ "'"his is the central multinomial coefficient, so it is the largest. It must therefore be at 
least a fraction (MB l )~ c > rT c of the sum of all multinomial coefficients. This sum is C MB% (the total 
number of possible colorings), so H(X) > log 2 (C MS */n) = MB 1 log 2 C — Clog 2 n. 

We argue that H(x(Q*)) = fl(MB l lgC). Indeed, x(Q*) reveals the coloring of vertices f<i(Q*) on 
level i, which number at least ^MB l . Given x(Q*)> to encoding X, we just write all other colors explicitly 
using iATB* log 2 C bits. Therefore H(x(Q*)) > H(X) - \MB l \og 2 C > MB i \og 2 C - C\g 2 n - 
\MB i \og 2 C = n{MB i \gC). ' □ 

We consider the communication game in which Alice represents the time of epoch i (her private input 
is Xa = fi), and Bob represents the time of epochs i — 1, . . . , 1 and the metaqueries (his private input is 
Xb = x{Q*))- Their goal is to determine whether all the metaqueries return true. 

Claim 9. Any zero-error protocol must have average case bit complexity Vt(MB l lg C). 

Proof. We turn our attention to the communication game. The set of inputs of Alice and Bob that lead to a 
fixed transcript of the communication protocol forms a combinatorial rectangle. More precisely, a transcript 
t represents a sequence of transmissions between Alice and Bob. On Alice's side, there will be a certain set 
X l A of inputs making her follow t provided that Bob follows t, and we have a corresponding input set X B 
from Bob. Inputs Xa and Xb will lead to t if and only if (Xa, Xb) € X\ x X B . Since the players must 
verify Xb = x{Q*) and the protocol has zero error, the rectangle cannot contain two inputs of Bob with 
different x(Q*)> that is, \X B \ = 1 for all valid t. Thus H(t) > H(x(Q*). □ 



We will use Lemma|4]to obtain a communication protocol, setting the rate of false positives in the Bloom 

;r to p = —. 

lg- 

:i U R* whe 
complexity is: 



filter to p = jgT^- The cells written in Alice's interval are precisely Wi, the cells read in Bob's interval are 
i?<i U R* where R* is the union of the cells read by all the metaqueries. By Lemma HI the communication 



E[|( J R <i Ui?*)n(W i \Ty <i )|-OH 
+ |Wi|-0(lgi) + p\R <t UR*\-0(w)] 

< E[\R* n (Wi \ W«)| + \R<i\] • 0(w) 

+ 0(|Wi|lglgn) + \R*\-0(pw) 

< E[|fl*n(Wi\W<i)|] • 0(w) + 0(MB i ~ 1 t v w) 

+ 0(MB i ■ t v lg lg n) + 0(MB l ■ ffi) (4) 
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Figure 1: The shape of our graphs. 

We compare this to the lower bound of £l(MB' L lg C) = Q.(MB % - sign) from Claim[9] We will now argue 
all the terms of dD except the first are asymptotically negligible compared to the lower bound. Inspecting 
the terms in order: 

• Since e = o(l), we have B = t\j S = uj(t\j/e), implying MB 1 ' 1 ■ t\jw = o(MB t ■ elgn), 

• By©, 

■ It? 77 

MB 1 ■ t v lg lg n < MB 1 — lg lg n = o(MB l • e lgn) 

lg lg n 

• the last term is always o(MB' L ), since = o(lgn) and w = O(lgn). 

As these terms of (O are dominated by the lower bound, we obtain ~E[\R* n (Wi \ W<i)|] = VL{eMB % ). 
From ®, linearity of expectation implies B[\R Q n (W f \ W<i)|] > E[|i2* n (Wj \ W <i )\]/B i , so we have 
completed the proof of Lemma|6]by showing E[|i?^ PI (Wj \ W<») |] = O(eM). As shown earlier, Lemma|6] 
implies the desired lower bound of Theorem |2] 

4 Lower Bound for Dynamic Connectivity 

Theorem 10. Any data structure for dynamic connectivity in graphs of n vertices that has (amortized) 
update time iij = o(lg n) must have (amortized) query time t q > n 1 ^"^. 

Let e be such that t\j = o(e 2 lgn), and define M = n 1_e and C = n £ . The shape of our graphs is 
depicted in Figure [T] The vertices are points of a grid [M] x [n/M]. The edges of our graph are matchings 
between consecutive columns. Let tt\, . . . , TT n /M-i be the permutations that describe these matchings. We 
let 7r<j = TTj o 7Tj_i o ■ ■ ■ o 7ri. Node i in the first column is connected in column j + 1 to 7r<j-(i). 

The graph also contains C special vertices, which we imagine are colored with the colors 1, . . . , C. At 
all times, a colored vertex is connected to a fixed set of M / C vertices in the first column. (For concreteness, 
colored vertex 1 is connected to vertices 1, . . . , M/C; colored vertex 2 to the next M/C vertices; etc.) 

We will allow two meta-operations on this graph: Update and Query. Initially, all permutations are 
the identity (i.e. all edges are horizontal). Update (j, Tr new ) reconfigures the edges between columns j and 
j + 1: it sets TTj to the permutation n new . This entails deleting M edges and inserting M edges, so Update 
takes time 2M • t\j. 
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Query (j, x) receives a vector \ £ [C] M , which it treats as a proposed coloring for vertices on column 
j. The goal of the query is to test whether this coloring is consistent with the coloring of the vertices in the 
first column. More specifically, a node i of color a in the first column must have xt^j'W] = a. A Query 
can be implemented efficiently by connectivity operations. First each vertex i in column j is connected to 
the colored vertex x[A- Then, for i = 2 . . M, we run a connectivity query to test whether colored vertex i is 
connected to colored vertex i — 1. If so, Query return false. Otherwise, it inserts an edge between colored 
vertices % and i — 1 and moves to the next i. At the end, Query deletes all vertices it had inserted. The total 
cell-probe complexity of Query is O(M) ■ t\j + C ■ t q . It is easy to observe that this procedure correctly 
tells whether the colorings are consistent (as in our instance of incremental connectivity). 

We will now describe the hard distribution over problem instances. We assume jj — 1 is a power of 
two. Let a be the bit-reversal permutation on {0, . . . , -ft — 2}: a(i) is the reversal of i, treated as a vector 
of log 2 ( A - 1) bits. For i = 0, . . . , -A — 1, we execute an Update to position j = a(i) + 1, and a Query 
to the same position j. The update sets itj to a new random permutation. The query always receives the 
consistent coloring, and should answer true. The total running time is 

T < n/M(2Mtu + 0(M)t v + Ct q ) = 0(nt v + (n/M)Ct q ). 

If we can prove a lower bound T = cu(nt\j), then this will yield a high lower bound for t q . 

For the lower bound proof, we consider a perfect ordered binary tree with n/M — 1. The leaves are 
associated with the pairs of Update and Query operations in time order. Let W(v) (respectively R(v )) 
be the set of cells written (respectively, read) while executing the operations in the subtree of v. Note that 
W(v) C R{v ), since we have assumed a cell must be read before it is written. Our main claim is: 

Lemma 11. Let v be a node with 2k leaves in its subtree, and let vl, v r be its left and right children. Then 
E[\W(v L ) n R(v R )\ + j^\W(v L ) U R(v R )\] = n(k ■ eM). 

Before we prove the lemma, we use it to derive the desired lower bound. We claim that the total expected 
running time is T > J2 V H R(vr)\], where the sum is over all nodes in our lower bound tree. 

Consider how a fixed instance is executed by the data structure. We will charge each read operation to a 
node in the tree: the lowest common ancestor of the time when the instruction executes, and the time when 
the cell was last written. Thus, each W{vl) PI R(vr) corresponds to (at least) one read instruction, so there 
is no double-counting in the sum. 

We now sum the lower bound of Lemma ITT1 over all nodes; observe that J2 V k v = lg since 
the tree has n/M - 1 leaves. We obtain £*, E [I W K) n R(v R )\] + i^E^ E [I^K) U R(v R )\] = 
lg ■ eM). The first term is at most T, as explained above. In the second term is also bounded by 
T. This is because J2 V E[|W(«l) U R(v r )\] < Tig jj since every cell probe is counted once for every 
ancestor of the time it executes. Thus 2T > Sl(-Algjj • eM) = Q,{e 2 n\gn). In our construction, the 
total running time was T = 0(nt\j + jjCt q ). Since £tj = o(e 2 \gn), the second term must dominate: 
^-t q = n{e 2 n\gn), so t q > M/C = n x ~ 2e = n 1 "^. 

Proof of Lemma [U We will prove the stronger statement that the lower bound holds no matter how we 
fix the updates outside node vl- 

We transform the problem into the natural communication game: Alice receives the update permutations 
in the subtree vl and Bob receives the colorings of the queries in the subtree v R (the updates are fixed). They 
have to check whether all queries are positive in the sequence of Update and Query operations defined by 
their joint input. 
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We apply Lemma [5] to construct a nondeterministic communication protocol for this problem, with 
complexity E[|W(t> L ) n R(v R )\ ■ O(lgn) + 0(\W(v L ) U R(v R )\)}. The conclusion of Lemma [ID follows 
by comparing this protocol to the following communication lower bound: 

Lemma 12. The game above has nondeterministic (average-case) communication complexity Q(kM lg C). 

Proof. Let Xa and X R be the inputs of the two players. For any choice of Xa, there is a unique sequence 
of colorings X R that Bob should accept. As in the proof of Lemma |9l we conclude that the public proof is 
an encoding of X R so we can lower bound the complexity via H(JTb). 

Let J a and Jb be the columns touched (updated and queried) in Alice's input and in Bob's input. Bob's 
input consists of the coloring of column j, for each j € Jb- This is 7r<j applied to the fixed coloring in the 
first column. 

Since Ja and Jb are defined by the bit-reversal permutation, we know that they interleave perfectly: 
between every two values in the sorted order of Jb, there is a unique value in Ja- Thus, the coloring for 
different j G Jb are independent random variables, since an independent uniform permutation from Ja 
is composed into 7r<j compared to all indices from Jb below j. Each coloring is uniformly distributed 
among balanced colorings, so it has entropy M lg C — 0(C lg M) (c.f. proof of Claim©. We conclude that 
R(X B ) = n(kM\gC). □ 

5 Amortized link-find bounds 

The union-find problem has been studied into excruciating detail and is now essentially understood. From 
an amortized perspective, Tarjan [ 15 ] showed that a sequence of n— 1 unions and m finds can be supported in 
time 0(n + ma(m, n)). See ifTTlfTTl for different analyses and trade-offs between amortized running times. 
From a worst-case perspective, the classic union-by-rank gives union in constant time and find in O (log re) 
time. Trade-offs were addressed by Blum [6 ], with an improvement by Smid [14]. They show that, if the time 
for union is bounded by iuNioN, Find can be supported in worst-case 0(lg re/ lg £union)- Finally, Alstrup et 
al. HI showed that the amortized and worst-case trade-offs can be achieved simultaneously. These bounds 
are known to be optimal in the powerful cell-probe model (see below for a review of the lower bounds). 

A similar phenomenon appeared in connection with union-find with deletions. Kaplan et al. flOl con- 
sidered this problem but wanted bounds where n represented the size of the actual tree(s) involved in an 
operation. All worst-case bounds are trivially local, and iflOl proved refined the standard amortized analysis 
to work locally, though the bound becomes a bit weird with the standard notation: a(n) is OK, but other- 
wise, it becomes a(n ■ \M/N~\ , n) amortized time per find where M and N are the global number of finds 
and unions, respectively. With the notation from Q, the local amortized find bound is 0(a^M/N] ( n ))- They 
showed how to augment union-find with a delete operation if we when deleting an element x, first find the 
root and then perform a local rebuilding step in the tree that x is deleted from. For t\j = 0(1), this gave 
them both find-root and delete in 0(log n) time. Similar to our case, they asked if the deletion time could be 
made better than this find time. For the deletions, the answer was yes. Alstrup et al. proved that deletions 
could be supported locally in constant time without affecting the 0(log n) bound on the query time. 

Back to original In this section we consider the amortized complexity of the link-find problem which is 
like the union-find problem except that we can link arbitrary nodes, not just roots. In link-find, we may not 
necessarily have an obvious notion of a root that we can find. The fundamental requirement to a component 
is that if we call find from any vertex in it, we get the same root as long as the component is not linked with 
other components. 
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Let u be the number of updates and q the number of queries. With union-find, the complexity over the 
whole sequence is Q(a(q,u)q) if q > u, and ®(a(q,q)q + u) if q < u. With link-find, we get the same 
complexity when q > u, but a higher complexity of @(a(q, u)u) when q < u. Thus, with link-find, we get 
a symmetric formula in q and u of 

6(a(max{g, u}, mm{g, u}) max{q, u}). (5) 

We get the upper-bound in © via a very simple reduction to union-find. 

5.1 The link-find data structure 

Nodes have three types: free, leaf, and union nodes. A leaf node has a pointer to a neighboring union node, 
and the union nodes will participate in a standard union-find data structure. The parent of a leaf is the union 
node it points to. The parent of a union node is as in the union-find structure and the parent of a root is the 
root itself. 

All nodes start as free nodes. We preserve the invariant that if a component has a free node, then all 
nodes in the component are free. 

To perform a find on a free node v, we scan the component of v. If it is a singleton, we just return it. 
Otherwise, assuming some initial tie-breaking order, we make the smallest node in the component a union 
node and all other nodes leaf nodes pointing to is. The union node which is its own root is returned. All this 
is paid for by the nodes that lost their freedom. 

To perform a find on a non-free node, we perform it on the parent which is in the union-find data 
structure. 

We now consider the different types of links. When we perform link between two free nodes, nothing 
happens except that an edge is added in constant time. 

If we link a free node v with a non-free node w, we make all nodes in the components of v leaves 
pointing to the parent of w. This is paid for by the new leaves. 

If we link two non-free nodes, we first perform a find from their parents which are union nodes. If they 
have different roots we unite them. 

This completes the description of our link-find data structure which spends linear time reducing to a 
union-find data structure. A union node requires a find on a non-singleton node, so the number of union 
nodes is at most mm{q, u}. Concerning finds in the union-find data structure, we get one for each original 
find on a non-free node. In addition, we get two finds for each link of two non-free nodes, adding up to at 
most q + 2u finds. Our total complexity is therefore 

0(u + q + a(q + 2u, mm{q, u})(q + 2u)) 
= 0(a(max{(/, u}, min{q, u}) max{g, u}). 

We are going to present a matching lower bound. 

5.2 The link-find data structure for a forest 

We will now show that it is the links between nodes in the same components that makes link-find harder 
than union-find in the sense that if no such links appear, we get the same O-bound as with union-find. 

The modification to the above link-find reduction is simple. Using standard doubling ideas, we can 
assume that u and q are known in advance. If q > u, we are already matching the union-find bound, so 
assume q < u. 
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To do a find on a free node, we again scan its component. However, if it has less than a(q, q) nodes, we 
just return the smallest but leaving the component free. Otherwise, as before, we make the smallest node a 
union node and all other nodes leaf nodes pointing to it. This is the only change to our link-find algorithm. 

In the case where the component has a(q,q) nodes, we clearly pay only 0(a(q,q)) for a find. The 
advantage is that we now create at most u/a(q,q) union nodes. Links involving a free node have linear 
total cost, and now, when we perform a link of non-free nodes, we know they are from different components 
to be united, so this will reduce the number of union roots by one. Hence we get at most 2u/a(q, q) finds 
resulting from these links. Thus, in the union-find data structure, we end up with q + 2u/a(q, q) finds and 
u/a(q, q) unions. The total cost is 

0(u + q + a(q + 2u/a(q, q),u/a(q, q))(q + 2u/a(q, q)) 
= 0(a(q,q)q + n) 

time. The simplification uses that a is increasing in its first and decreasing in its second argument, and that 
the whole time bound is linear if q < u/a(q, q). 
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Appendix a. Lower Bounds for Amortized Link-Find 

We will now sketch a proof for the lower-bound in (|5) with u link updates and q find queries. When q > u, 
we get this from the union-find lower bound of Q,(a(q,u)q) from J8]. However, for q <C u, we need to 
prove a higher lower-bound than that for union-find. The lower bound we want in this case is Q(a(u, q)u). 

We would get the desired lower bound if we could code a union-find problem with £l(q) updates and 
Q(u) queries. We cannot make such a black-box reduction, but we can do it inside the proof construction 
from [8 ]. We will only present the idea in the "reduction". For a real proof one has to carefully examine the 
whole proof from [8] to verify that nothing really breaks. 

The lower bound construction from [8 1 proceeds in rounds. We start with singleton roots. In a union 
round, we pair all current roots randomly, thus halving the number of roots. In a find round, we perform a 
number of finds on random leaves. The number of finds are adjusted depending on the actions of the data 
structure. From [|9| we know that the lower bound also holds if the finds just have to verify the current root 
of a node. 

In our case, we will start with n roots. In a union-round, we just link roots as in union-find. However, in 
a find round, instead of calling find from a leaf v, we link v to its current root r. We want to turn this leaf -root 
link into a verification. We will not do that for the individual links, but we will do it for the find-round as a 
whole (one needs to verify that this batching preserves the lower-bound). At the end of the find-round, we 
simply perform a find on each root. All these finds should return the root itself. If one of the links (v, r) 
had gone to the wrong root and r' was the correct root, then r and r' would be connected in the same tree, 
which means that they cannot both be roots. One of the finds would therefore return a different root. If 
the union-find problem we code used / finds, then our link-find solution ends up with u = n — 1 + / link 
updates and q = n — 1 find verifications, hence with the desired lower bound of 

tt(a(f,n)f) = Q(a(u,q)u). 
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